rank | frequency | n-gram |
---|---|---|
1 | 87056 | -র |
2 | 56899 | -। |
3 | 50970 | -ে |
4 | 37913 | -া |
5 | 22909 | -ি |
rank | frequency | n-gram |
---|---|---|
1 | 39671 | -ের |
2 | 17919 | -ার |
3 | 17574 | -কে |
4 | 8582 | -ে। |
5 | 8344 | -তে |
rank | frequency | n-gram |
---|---|---|
1 | 7154 | -দের |
2 | 5503 | -নের |
3 | 5079 | -রের |
4 | 3242 | -িতে |
5 | 3157 | -াকে |
rank | frequency | n-gram |
---|---|---|
1 | 2275 | -ারের |
2 | 1986 | -গুলো |
3 | 1606 | -ানের |
4 | 1529 | -ীদের |
5 | 1468 | -ভাবে |
rank | frequency | n-gram |
---|---|---|
1 | 1051 | -গুলোর |
2 | 880 | -গুলো |
3 | 642 | -ুলোতে |
4 | 543 | -রীদের |
5 | 539 | -ত্তিক |
The tables show the most frequent letter-N-grams at the ending of words for N=1…5. Everything runs in parallel to 2.2.5 Most frequent word beginnings. The aim is suffix detection instead of affix detection.
For N=3:
SELECT @pos:=(@pos+1), xx.* from (SELECT @pos:=0) r, (select count(*) as cnt ,concat("-", right(word,3)) FROM words WHERE w_id>100 group by right(word,3) order by cnt desc) xx limit 5;
2.2.5 Most frequent word beginnings